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Abstract — The classical problem in network coding theory con- 
siders communication over multicast networks. Multiple trans- 
mitters send independent messages to multiple receivers which 
decode the same set of messages. In this work, computation 
over multicast networks is considered: each receiver decodes 
an identical function of the original messages. For a countably 
infinite class of two-transmitter two-receiver single-hop linear 
deterministic networks, the computing capacity is characterized 
for a linear function (modulo-2 sum) of Bernoulli sources. 
Inspired by the geometric concept of interference alignment 
in networks, a new achievable coding scheme called function 
alignment is introduced. A new converse theorem is established 
that is tighter than cut-set based and genie-aided bounds. Com- 
putation (vs. communication) over multicast networks requires 
additional analysis to account for multiple receivers sharing a 
network's computational resources. We also develop a network 
decomposition theorem which identifies elementary parallel sub- 
networks that can constitute an original network without loss of 
optimality. The decomposition theorem provides a conceptually- 
simpler algebraic proof of achievability that generalizes to L- 
transmitter L-receiver networks. 

Index Terms — Computing Capacity, Function Alignment, Net- 
work Decomposition Theorem 



I. Introduction 

Recently coding for computation in networks has received 
considerable attention with applications in sensor networks |[T) 
and cloud computing scenarios J2], UJ. In a sensor network, 
a fusion node may be interested in computing a relevant 
function of the measurements from various data nodes. In a 
cloud computing scenario, a client may download a function or 
part of the original source information that is distributed (e.g. 
using a maximum distance separable code) across multiple 
data nodes. 

The simplest setting for computation in networks consists 
of multiple sources transmitting information to a single re- 
ceiver which computes a function of the original sources. 
Appuswamy et al. study the fundamental limits of computation 
for linear and general target function classes for single- 
receiver networks 0). While limited progress has been made 
for general target functions, the problem of linear function 
computation in single-receiver networks has been solved in 
part due to a duality theorem establishing an equivalence 
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to the classical problem of communication over multicast 
networks |5). As a consequence, it was shown that the cut-set 
based bound is tight in the single-receiver case. 

Several results over the past decade have contributed to 
the understanding of classical communication in multicast 
networks in which the task is to transmit raw messages from 
transmitters to a set of receivers with identical message de- 
mands. The celebrated work of Ahlswede et al. [5 1 established 
that the cut-set bound is tight for multicast communication. 
Subsequent research developed practical linear network coding 
strategies ranging from random linear codes to deterministic 
polynomial-time code constructions J6], J7), JS], |9). The 
success of traditional multicast communication motivates us to 
explore the fundamental limits of multicasting a linear function 
in multi-receiver networks as a natural next step. For this open 
problem, some facts are known based on example networks: 
(a) Random codes are insufficient in achieving capacity limits, 
and structured codes achieve higher computation rates ifTOl ; (b) 
Linear codes are insufficient in general for computation over 
multi-receiver networks (cf. both IfTD and Ifl2l ) and non-linear 
codes may achieve higher computation rates. 

To make progress on the problem of multicasting a function 
in multi-receiver networks, we consider the simplest two- 
transmitter two-receiver network in which both receivers com- 
pute a linear function (modulo-2 sum) of two independent 
Bernoulli sources generated at the transmitters. Specifically, 
we consider the Avestimehr-Diggavi-Tse (ADT) deterministic 
single-hop network model IfTJl which captures superposition 
and broadcast properties of wireless Gaussian networks and is 
a generalization of networks of orthogonal links. We develop 
a new achievable coding scheme termed function alignment 
inspired by the concept of interference alignment 03), fl6l . 
We also derive a new converse theorem that is tighter than cut- 
set based bounds and genie-aided bounds. As a consequence of 
this capacity result, we find that unlike the single-receiver case, 
the cut-set based bound is not achieved due to competition 
for shared network resources that arise in satisfying function 
demands at multiple receivers. 

As a byproduct of our analysis, we develop a network 
decomposition theorem to identify elementary parallel sub- 
networks that can constitute an original network without 
loss of optimality for in-network computation. The network 
decomposition approach offers a conceptually simpler proof of 
achievability which we use to establish the linear computing 
capacity of L-transmitter L-receiver single-hop networks. In 

1 Niesen-Nazer- Whiting 1141 introduced a similar scheme in a different 
context called computation alignment. 
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Fig. 1, Two-transmitter two-receiver Avestimehr-Diggavi-Tse (ADT) deter- 
ministic network 



addition, the approach has potential for the design of structured 
computation codes in larger multi-hop networks. 

Related Work: In 021, 031, QS1, tne computing capac- 
ity for multicasting a sum of sources is explored for arbi- 
trary multiple-source multiple-destination networks. Rai and 
Dey llTTl proved that there exists a linear solvably equivalent 
sum-network for any multiple-unicast network and vice-versa. 
Ramamoorthy and Langberg fTBl characterized necessary and 
sufficient conditions for communicating sums of sources of 
two-source L-destination (or L-source two-destination) net- 
works, when the entropy of each source is limited by 1. On 
the other hand, our work considers sources without entropy 
constraints and establishes the exact capacity of an ADT 
multi-receiver network which is a generalization of traditional 
network coding models with orthogonal links. 

II. Model 

We focus on a two-transmitter two-receiver ADT determin- 
istic network. Section|VT]includes our results for i-transmitter 
L receiver networks. As shown in Fig. Q] this network is 
described by four integer parameters ?iy which indicates the 
number of signal bit levels from transmitter i (i = 1,2) to 
receiver j (j = 1,2). Let Xg 6 ¥\ be transmitter ts encoded 



signal where q 
given by 



m&Xij riij . The received signals are then 



Yi = G q - llll X x © G q - ll21 X 2l 
Y 2 = G q - ,ll2 X 1 © G q - ,l22 X 2 , 



(1) 



where G is the q-by-q shift matrix, i.e., [G]ij = l{i = j + 1} 
(1 < i < q; 1 < j < q), and operations are performed in F2. 

Each receiver wishes to compute modulo-2 sums of the 
two Bernoulli sources Sf and S 2 , generated at the two 
transmitters, with N uses of the network. Here we use 
shorthand notation to indicate the sequence up to K, e.g., 
Sf := (Six,-- - ,S 1K )- We assume that Sf and Sf are 
independent and identically distributed with Bern(i). Trans- 
mitter I uses its encoding function to map Sf to a length- 
N codeword . Receiver I uses a decoding function dg to 
estimate Sf © S 2 from its received signal Y e N . An error 
occurs whenever dg 7^ Sf © Sf . The average probabilities of 
error are given by X e = E [P(dg ^ Sf © Sf )J ,£ = 1, 2. 

We say that the computing rate i? C omp = 77 is achievable 
if there exists a family of codebooks and encoder/decoder 
functions such that the average decoding error probabilities 
of Ai and A2 go to zero as code length N tends to infinity. 
We will also need the notion of linear computing capacity 
C^ n mp , where we restrict both the encoders and the decoders 



to be linear mappings. In line with the standard network coding 
literature, when referring to the linear computing capacity, we 
will assume a zero-error framework rather than the framework 
of negligible error we use in the context of the regular 
computing capacity. 

We classify networks into two classes, depending on a 
reconstructability condition that will be specified in the sequel. 
The reconstructability turns out to be the key property that 
classifies networks. This will be clarified when we prove an 
upper bound on the computing capacity in Theorem Q] 

Definition 1: A network is said to be degenerate if none of 
G q ~ nij Xi can be reconstructed from (Yi,Y 2 ) for all A 
network is said to be non-degenerate if there exists such 
that G q ~ Uij Xi can be reconstructed from (Yi,Y 2 ). 

Lemma 1: A network is degenerate if and only if mi — 
n\ 2 = n 2 \ — n 22 . As a direct consequence, a network is non- 
degenerate if and only if nu — ni 2 ^ n 2 \ — 71,22. 

Proof: See Appendix lAl ■ 

III. Main Results 

Theorem 1 (Upper Bound on Computing Capacity): The 
computing capacity is upper-bounded by 

Ccomp < min{nn,ni2,n 2 2,n2i} ■ (2) 
For non-degenerate networks where nu — n\ 2 7^ 71,21 — n 22 , 
max^n, n 2 i ) + max(Ti 2 2, "12) n \ 



Proof: See Section lilLAl ■ 
We show the tightness of the above bounds for the following 
two cases: (a) degenerate networks; (b) symmetric networks 
characterized by two parameters of n := nu = n 22 and m := 

T112 = 7121. 

Theorem 2 (Degenerate Networks): For degenerate 
networks where nu — 7112 = ?i2i — n 22 , 



C 



= minjnii, 7ii2, 7i22, 77,21}- 



(4) 



Proof: The converse proof is immediate from TheoremQ] 
See Section IIII-BI for the achievability proof. ■ 
Theorem 3 (Symmetric Networks): For symmetric 
networks where 71 := nu = n 22 and m := 7112 = 7121, 



a 



comp 



mm 

71, 



{m, 



n, I max(m, 71)} , 7717^71; 



(5) 



Proof: The converse proof is immediate from Theorem Q] 
See Section [IV] for the achievability proof. ■ 

Our results are interpreted with a focus on symmetric 
networks. Specifically, it will be shown that our scheme out- 
performs the separation scheme where both receivers decode 
all of the sources and then compute modulo-2 sums of the 
sources. It will also be revealed that in contrast to a single- 
receiver function-unicasting case, the cut-set based bound is 
not tight when multicasting linear functions. 

For illustrative purpose, consider the normalized computing 
capacity as follows: 



Co 



min {a, |} , a < 1; 
1, a = 1, 



(6) 
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Fig. 2. Normalized computing capacity. Here n := tin = 7222 and m : = 
ni2 = Ti2i- The parameter o in x-axis captures a signal-strength similarity 
between m and n. 



where q = max(m, n) and a 



min(m,n) 
? 

Remark 1 (Comparison to Separation Scheme): The com- 
puting rate of the separation scheme can be derived from 
the multicast capacity. Note that the multicast capacity is the 
intersection of the two individual MAC capacities: the set C mu i t 
of (Ri,R 2 ) such that Ri < min(m,n), R 2 < min(m,n) and 
Ri + R 2 < max(m, n). Therefore, this gives 



(7) 



where C E 



sym 



Rsea Csvm . 1 

- = — - — = mm < a, — 

q q I '2 

sup{i? : (R, R) e Cmuit}- While this separa- 
tion approach provides the optimal strategy for < a < ^, 
it is suboptimal for the other regime i < a < 1. Note that 
for i < a < 1, more-than-half of signal levels at receivers 
naturally form the mod-sum function of our interest. It turns 
out that this natural matching can provide higher computing 
rates. Details will be explained in Section HVl □ 

Remark 2 (Comparison to a Single-Receiver Case): In a 
single-receiver case, the computing capacity achieves the 
cut-set based upper bound, which will be formally proven 
to be min(?7i, n) in the next section. On the other hand, 
the cut-set bound is not tight when multicasting a function. 
Notice the non-zero gap between the function-unicasting 
and function-multicasting capacities when | < a < 1 (see 
Fig. |21). This comes from the tension that arises in satisfying 
the same demand at multiple receivers. We will clarify this 
while presenting our achievability in Section [IV] □ 



A. Proof of Theorem [7] 

The proof of the bound (O is based on the standard cut-set 
argument. The main focus is to prove the second bound ([3]). 
Proof of ©: Starting with Fano's inequality, we get 



N{R, 
< I{S 



-e N )<I(S«®S?;Yf) 



comp 
K 



qK. v 1\ q 
°2 i 1 1 ' . 



0) 



I(S( 



A" 



(b) 



I(S{ 



K 



>S 2 K ;Y 1 N \S 2 K ) 

(c) 



H{Y»\S«,X») < Y,H(Y u \X 2l ) < Nn n 



where (a) follows from the fact that S 2 is independent of 
Sf- © S 2 ; (b) follows from the fact that X 2 is a function 
of S 2 ; (c) follows from the fact that conditioning reduces 
entropy. If -R C omp is achievable, then — > as N tends 
to infinity. So we get i? CO mp < nix- Similarly we can show 



that R c 



< mm{H(Y 2 \X 2 ),H(Yi\Xi),H(Y 2 \Xi)} < 



mm{ni 2 ,n 2 i,n 22 }. 

Proof of (|3): For non-degenerate networks, by defini- 
tion, there exists such that G q ~ Uij Xi can be recon- 
structed from (Yi, Y 2 ). Without loss of generality, assume that 
G«~" 12 Xl is a function of (Yi,Y 2 ). 

Starting with Fano's inequality, we get 



N(3R C 



K 



S?;Y 9 N ) 



I(S( 



K 



< [H(Y 1 N )-H(Y 1 N \S^®S 2 K )] 
+ \H{Y 2 N ) - H{Y 2 N \S? - aK 

< H(Yf) + H(Y 2 N ) 
-HiYf^Sfi 

( => H{Y?) + H{Y 2 N ) 
-HiYf^flS? < 

(c) — ^ ' H(Y 2 N ) 



SZ,Y»)]+I(S? ®S?;Y») 



S£)+H(T 12 X?\Sf) 



H(Y» 

-H(Yf,Y 2 N \S?> 

^ H(Yf) + H(Y 2 N ) 

- H{Y^ , Y 2 N , T 12 Xf|Sf © Sf) + H{T l2 X^\S^) 
< H(Yf) + H(Y 2 N ) 



(T 12 Xf|Sf © 5f ) + H(T 12 X?\S?) 



(<0 



H(Y{ 



H{Y 2 N ) - H(T 12 X?) - 

) + H(Y 2i )] 
< 2iV[max(nn, n 21 ) + ma,x(n 12 ,n 22 ) 



where (a) follows from the fact that conditioning reduces 
entropy; (b) follows from the fact that Sf is independent of 
Si © S 2 ; (c) follows from the fact that X 2 is a function 
of Sf and that T 12 := Ijy ® G«-™ 12 ; (d) follows from 
our hypothesis that G q ~ ni2 Xi is a function of (Yi,Y 2 ); (e) 
follows from the fact that is a function of S^ that is 
independent of Sf- © S 2 ; (/) follows from the fact that 
conditioning reduces entropy. This completes the proof. 

B. Proof of Theorem [2] 

Assume that tin — n\ 2 = n 2 \ — n 22 > 0. Then Y 2 is a 
degenerated version of Y\ : 

Y 2 = G q - ni2 Xi © G q - n22 X 2 

_ Q7-nn+n 2 l-«22^ i Q q - n22 X 2 



= G' 



122 Yi. 



This shows an equivalence to a single-receiver case which 
concerns receiver 2's demand only. Hence, in this case, 



Rrt 



min{rii2, 71.22}- Similarly for the other case of 
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Ni :— (1121 + ni2 — rcn) + — 2m — n 
N2 •— (?121 + ni2 — 7122) + — 2m — 71 



Fig. 3. [Case I: 1 < a < 2]: An achievable scheme for (m,n) = (3,5) 
and generalization to arbitrary values of (m, n). 

nn — "12 = fi2i ^ "-22 < 0, one can show that Y\ is a 
degenerated version of Y% and therefore a network becomes 
equivalent to a single-receiver case w.r.t receiver 1 where 
-Rcomp = min{nii,n 2 i}. 

IV. Proof of Theorem[3Jvia Geometric Approach 

By symmetry, focus on the case of to < n. The other case 
of to > n is a mirror image in which transmitters 1 and 2 are 
swapped. As mentioned in Remark Q] the separation scheme 
can achieve the computing capacity for < a < |. The case 
of a = 1 is a degenerate case where the channel forms the 
mod-sum function by nature at both receivers. In this case, 
uncoded transmission can yield i? C omp = n>. Hence, our focus 
is the following two non-degenerate cases. 

A. Case I: \ < a < | 

Let us explain achievability with the example (m, n) = 
(3,5) illustrated in Fig. [3] We will show that the cut-set 
bound of min(TO, n) = 3 can be achieved. First transmitter 1 
sends the bits (0,1,0,2,0,3) on the top 3(= m) levels. Observe 
that the 3rd level at receiver 1 marked with a green square 
is connected with transmitter l's upper to levels as well as 
transmitter 2's upper to levels. The idea is to exploit this 
overlapped level. Transmitter 2 sends 63 on the top level 
to achieve a 3 © 63 on the overlapped level at receiver 1. 
In an arbitrary case, the number of overlapped levels is 
Ni := ?ii2 + «2i — n>u = 2m— n. On the other hand, the bit 63 
is cleanly received at receiver 2 without being interfered with 
by (01, a,2, 03), since Ai+to < n in the regime of 5 < a < |. 
Similarly let A2 := ni2 + ?i2i — ft.22 = 2m — n be the number 
of levels at receiver 2 which are connected with transmitter 
l's upper to levels as well as transmitter 2's upper to levels. 
In this example, level 3 at receiver 2 is the overlapped level. 
Transmitter 2 then sends 61 on the 3rd level so as to achieve 
a\ © 61 on the level at receiver 2. This 61 is cleanly received 
at receiver 1 , since N2 + to < n in the regime of 5 < a < | . 

Finally notice that level 2 at transmitter 2 is vacant among 
the top to levels. In an arbitrary case, the number of these 
vacant levels is to— (N1 + N2). Transmitter 2 sends additional 
symbols (62 in this example) on the vacant to — (Ai + A2) 
levels. Obviously these symbols are cleanly received at both 
receivers. In summary, receiver 1 can compute ai©6i, 02062, 
and 0.3 © 63. In an arbitrary case, the total number of these 



computable bits is Ai+A2 + {to— (A1+A2)} = to. Similarly 
receiver 2 can compute m bits. Therefore, we can achieve 

^comp m. 

Remark 3 (Exploiting Channel Structure H10V , H19V ): In 
the regime of \ < a < |, more-than-half of signal levels 
at receivers naturally form the mod-sum function. This 
enables us to create a shared linear subspace. Note in the 
above example that at receiver 2, the symbols (01,61) share 
one-dimensional linear subspace spanned by [0,0,1,0,0]*, 
where [•]* indicates a transpose. This enables us to outperform 
the separation scheme where shared subspaces do not exist. 
□ 

Remark 4 ( Connection to Interference Alignment): Note 
that the linear subspace with respect to a\ is aligned with 
the subspace w.r.t b\. In this sense, it is an instance of the 
important concept of interference alignment 031, lfl6l which 
has shown great potential for a variety of applications such 
as interference channels lfl6l . cellular networks l20l . 12TI . 
distributed storage networks |22l . l23l . Il24ll and multiple 
unicast networks 1231 . l26l . But the distinction w.r.t our 
problem comes from the purpose of alignment. In our 
problem, the aim of alignment is to form a desired function 
while minimizing the signal subspace occupied by the source 
symbols. To highlight this purpose, we call it function 
alignment. □ 

B. [Case II: | < a < 1 ]: Example 

Unlike Case I, our achievability for this regime employs a 
vector-coding scheme. We first explain our achievability idea 
with the example (to, n) = (3, 4) illustrated in Fig. [4] We will 
then invoke a geometric insight which helps generalizing to 
arbitrary values of (to, n). The generalization will be explained 
in the next section. 

Our achievability idea is to alternate function alignment at 
both receivers. See Fig. |4] We first achieve function alignment 
di©&i at receiver 1. We next achieve 0,2062 at receiver 2. We 
repeat this until all of the resource levels are fully utilized. At 
the end of time 1, receiver 1 can then compute all of ai © 6i's 
(i = 1, 2, 3). However, receiver 2 can compute only ai © 61, 
a.2 © 62 and 63. Since we start favoring receiver 1, we end up 
with this asymmetry. 

In order to make it symmetric, we invoke the idea of vector 
coding. In time 2, we start by favoring receiver 2 instead and 
repeat the same procedure as before. We can then obtain a 
symmetric solution at the end of time 2. However, the solution 
is still inefficient. Note that b$ is missing at receiver 1, and 
similarly 03 is missing at receiver 2. To improve, we use 
another time slot. In time 3, we now have two purposes: (1) 
sending fresh source symbols; (2) delivering the b$ and 03 to 
receivers 1 and 2 respectively. We first multicast fresh symbols 
07 © 67 and as © 6g with alternating function alignment. Next 
transmitter 1 sends 03 (wanted by receiver 2) on the third 
level. But this transmission causes interference to 6s which 
was already received at receiver 1 . Fortunately we can resolve 
this conflict. Here the key observation is that 03 © 63 is 
already obtained at receiver 1 in time 1. Hence, transmitter 
2 sending 63 on top of 6§ in time 3, we can achieve the 
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Fig. 4. [Ca.se /// § < a < 1]: Alternating function alignment for (m, n) = (3, 4). 



function alignment 03 © 63 at receiver 1 . The 03 © 63 already 
received in time 1 can then be exploited as side information 
to decode 6g from 6s © 03 © 63. As a result, transmitter 1 
can deliver the 03 to receiver 2 without interfering with 6§ at 
receiver 1 . Similarly transmitter 2 can deliver the b§ to receiver 
1 without interfering with 07 at receiver 2. Both receivers can 
now compute ai © foj's for i = 1, • • • ,8 during 3 time slots, 
thus achieving i? C om P = §■ 

Geometric Interpretation: To aid generalization to 
arbitrary values of (m, n), we invoke geometric in- 
sights from the (3,4) example. In this example, v = 
[1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0]* can be viewed as a beamforming 
vector for 01. Beamforming vector designs are closely associ- 
ated with function alignment. To achieve function alignment 
ai © bi at receiver 2, transmitter 2 designs its corresponding 
vector as Tv, where T indicates the 3-time-slot equivalent 
channel: T := I 3 ® G 4 " 3 = I 3 ® G. With this geometric 
viewpoint, we can interpret the (3, 4) example solution as in 
Fig.0 

Let a := (02, 04, ag, ae)* and a := (01,05,07,03)*; 
similarly b := (62,04,68,65)* an d b := (61,65,67,63)*. Let 
Vi be a 12-by-4 beamforming matrix w.r.t. a. Let V2 be a 
12-by-4 beamforming matrix w.r.t. b. According to the code 
construction in Fig. [4] we have 



Vi 



" 1 








" 




" 1 
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(8) 



To achieve function alignment a © b at receiver 2, transmitter 
2 sends b along with TVi. Similarly to achieve a © b 
at receiver 1, transmitter 1 sends a along with TV2. Re- 
ceiver 2 then gets V26 and (T 2 V2)a. One can verify that 
rank([Vi TV 2 T 2 Vi]) = rank([V 2 TV 1 T 2 V 2 ]) = 12. 
This enables both receivers to compute a © b and a © b. 



C. [Case II: | < a < 1]: Generalization 



We now provide a code construction of Vi and V2 for 
arbitrary values of (m, n). Let Mi be the column size of Vi, 
i.e., the number of symbols that form function alignment at 
receiver 2. Similarly, let M% be the column size of V2. In the 
previous (3,4) example, Mi = M2 = 4. Notice that i? com p = 
Ah+Ah j s ac hievable if the following matrices are full rank: 



Bi := [Vi, T'Vi, TV 2 ] G Fi; 
B 2 := [V 2 



3rtx(2Mi+M 2 ) 



T 2 V 2 , TVj] G F 3nx(Ml+2M2) . 



We choose appropriate values of (Mi, M2) such that Mi + 
M2 = 2n and thus can yield i? C omp = Considering the 
total dimension of the linear subspace at receiver 1, we get 
2Mi + M 2 < 3n. Similarly for receiver 2, we get M x +2M 2 < 
3n. This motivates us to choose Mi = A/2 = n. 

We construct (Vi, V2) such that Bi and B2 are full rank. 
The form of Vi and V2 in (H) inspires our construction in 
the general case. Note that the first three columns of Vi and 
V2 are the same, say V. Inspecting more examples, we could 
identify the dimension of V as 3n-by-3(n — m): 



V, 



[V Pi] G F 3nxn 
V 2 = [V P 2 ] G F 3,lXTl 



3n X 3(n — rn) 



where V G F!_ 
The form of ([8]) inspires: 



and Pi £ F 



3rax(3m-2n) 



V = I 3 © [e 



O) 



» 



(9) 



1,2. 



(10) 



(n) 

where e,- G F 2 l indicates the ith coordinate vector in an n- 
dimensional space. Note in (O that Pi and P2 bear a strong 
similarity: the (9th-12th) rows are identical; the (lst-4th) rows 
of P2 are the same as the (5th-8th) rows of Pi. Inspecting 
more examples, we could develop a construction: 



Pi = ef 



.(3) 



,(3) 



,(3) 



{[. 



fe (n) 

L e (n-m) + l 
(n) 

B 2(n-m) + l 

L e (n-m) + l 
(n) 
2(( 



■m) + l 



_(n) 
e 2m- 

»(n)l 
J m J 

(») 

e 2m- 
,(«)! 



[e 



(n) 
3(n- 



m) + l 



1} 



3 (n) 

J 3(n— m) + l 



(ID 



The following lemma shows that this code ensures the full 
rank of Bi and B2. This completes the proof. 
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(TV 2 )a\ Via 



V,bK 



(TVi)b 



M,=n 



Vi 



3/i 



Via (T 2 V!)b 




v. 



3)1 



+ M s = 3n 

(TV 2 )(a©b) 

[Afi + 2j|/ 2 = 3n 

TVi)(aeb) 



Fig. 5. Geometric interpretation of an achievable scheme. 



Lemma 2: 



rank [Vi, TV 2 , T 2 Vi] 
rank[V 2 , TVi, T 2 V 2 ] 



= 3n, 
= 3n. 



(12) 



Proof: See Appendix 151 



V. Proof of Theorem[3]via Network Decomposition 

In this section, we present a network decomposition the- 
orem that permits to decompose a network into elementary 
subnetworks. The decomposition theorem applies not only to 
the two-user network discussed so far, but directly extends 
to the L-user network, which will formally be introduced in 
Section [Vi] Using this theorem for the case of L = 2 users, we 
will provide an alternative conceptually-simpler achievability 
proof of Theorem|3]by coding separately over each elementary 
subnetwork. Interestingly, this coding strategy is sufficient to 
meet the converse bounds, and hence, to establish computation 
capacity, thus establishing a separation principle among the 
building blocks. This observation is somewhat surprising — in 
general interference channel problems, coding separately over 
parallel channels entails a significant loss in performance. 

For the general case of L users, we will evaluate the 
performance of this coding approach in Section |VT] and show 
that it matches the upper bound for linear coding strategies. 

Theorem 4 (Network Decomposition): For the L- 
transmitter L-receiver (to, n) network where m ^ n, 
the following network decompositions holdjl 

(1) For any k e Z + , 

(km, kn) = (to, n) k = (to, n) x (m, n) X . . . X (m, n). 

(2) (2m + 1, 2n + 1) = (m, n) x (m + 1, n + 1) 

(3) For the arbitrary (m, n) model, 



(to, n) 



(r,r + 1) 
(r + l,r) 



m—n— 



(13) 

a x (r + l,r + 2) a , m < n; 
a x (r + 2,r + l) a , m> n. 



-We use the symbol x for the concatenation of orthogonal models, just 





Fig. 6. A network decomposition example of an (m, n) = (2, 7) model. 
From I13K r = and a = 2; hence, the decomposition is given by (2, 7) = 
(0, l) 3 X (1,2) 2 . 



where 



iin{m, n} 

1 71 — 77l| 

a = min{m,n} mod \n 



(14) 



like in 



The proof is given in Appendix [C] Here we provide a 
proof idea with an (m, n, L) = (2, 7, 2) example, illustrated 
in Fig. [6] The idea is to use graph coloring with \n — m\ = 5 
colors, identified by integers {0,1,2,3,4}. At transmitter 1, 
assign to level 1 and level 6 (= 1 + \n — m|) the color 
(blue color in this example). Use exactly the same rule to 
color the levels of transmitter 2 and receivers 1 and 2. The 
blue-colored graph represents an independent graph of model 
(1, 2). Next we assign the color 1 (red color in this example) 
to level 2 and level 7 (= 2 + \n — m|), for all transmitters 
and receivers. We then obtain another independent graph of 
model (1,2) and are left with model (0,3). Obviously the 
model (0, 3) is decomposed into (0, l) 3 . Therefore, we get 
(2,7) = (1,2) 2 x (0,1) 3 . 

Remark 5: Unlike the L = 2 case, for L > 3, the case 
to < n is not symmetric with m > n. Nevertheless, the above 
symmetric decomposition holds even when L > 3. □ 

Remark 6: The separation principle among these decom- 
posed subnetworks is not generally true. It is well known 
that for parallel interference channels, optimal performance 
requires joint coding across orthogonal components. □ 
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x (l,2) m . Thus, using Lemma [3] the 
comp = • (n — 2m) + 1 • m = to. Next, 



Theorem |4] suggests that fundamental building blocks are of 
form (r, r + 1) or (7- + 1, r), that is, "gap-1" models. Hence, 
we focus on the computing rates of the "gap-1" models. 

Lemma 3 (L = 2): The following computing rates are 
achievable: 

(1) For the model (0, 1), i? comp = 0. 

(2) For the model (1,2), i? comp = 1. 

(3a) For the model (r, r + 1) with r > 2, i? comp = §0" + !)• 
(36) For the model (r + 1, r) with r > 2, i? comp = |(r + 1). 
(4) For the model (r, r), -R C omp = 

This lemma can be proved via the geometric approach in 
Section [IV] We give a short explicit proof in Appendix iDl 
showing that explicit codes for the (3, 4) and (4, 5) models 
(found, for example, via the method from Section HVb directly 
imply the general proof of the lemma. 

Achievability Proof of Theorem [3j By symmetry, we focus 
on the case of to < n. For the case of < a < ^, r = 
and a = m in (TBI : hence, the decomposition is given by 
(m,n) = (0,l) n - 2m 
computing rate is R { 

consider the case of | < a < |. Applying the decomposi 
tion ( [T3l l. we find that in this case, r — 1 and a = 2m — n: 
(m,n) = (i,2) 2 "- 3m x (2,3) 2m ~ n Thus, using Lemma 
the computing rate is i? C omp = l-(2n— 3m)+2-(2m— n) = m. 
Finally, consider the case of a > §. Applying the decompo- 
sition ( TT3b . we find that in this case, r > 2. So we get 

2 2 
-Rcomp = -(r + l)(n - m - a) + -(r + 2)a 

2 

= — {r(n — m) + a + (n — to)} 
3 

(a) 2 2 

= 3 l m + ( n ~ m )s = 3 n - 

where (a) is due to dl4Y This completes the proof. 

Remark 7: At first, it might seem that this proof is simpler 
than our arguments in Section [IV] However, we point out that 
proving Lemma [3] is not straightforward, and hence, that there 
is no clear ordering as to which proof is simpler. Both proofs 
carry different intuitions and insights into the structure of the 
problem. □ 

VI. L x L Symmetric Networks 

We consider an L(> 3) -transmitter L-receiver network 
where all of the L receivers want to compute a mod-2-sum of 
all of the Bernoulli sources generated at the transmitters. We 
consider a symmetric setting where the two integer parameters 
of (to, n) describe the network. Here n indicates the number 
of signal bit levels from transmitter I to receiver t; and to 
denotes the number of signal bit levels from transmitter I to 
receiver t). See Fig. [7] for an (m,n) = (3,4) example 
of the network. The received signal at receiver I is given by 

Y e = G q - n X e ©0G«- m X,-, 

for 1= 1,2,..., L. 

Theorem 5: The linear computing capacity is 

min {to, r 
n, 



(15) 




Cl 

ai e 61 



Fig. 7. Achievable scheme for the (r — 1, r) model where r = 2k. 

The computing capacity is upper-bounded by 

„ J min < to, n, 2 l-i max ( n J m ) \ 1 m n 'i 

L-comp S \ y > 



a 



comp 



, n, 5 max(n, m)| , m^n; 



Proof: See Section IVI-AI for the achievability proof and 
Section [VI-BI for the coverse proof under linear coding strate- 
gies. See Section IVI-CI for the information-theoretic upper 
bound. ■ 
Remark 8: In general networks, the linear capacity is often 
not equal to the capacity and non-linear codes may achieve 
higher rates lfl2l . In the limit of L —> 00, however, linear codes 
show the optimality. Note that our information-theoretic upper 
bound approaches the achievable rate as L tends to infinity, 
thus establishing the asymptotic computing capacity. □ 

A. Achievability Proof 

The idea is to combine the network decomposition in The- 
orem [4] and achievability proof for elementary subnetworks. 

Lemma 4 (L > 3): The following computing rates are 
achievable: 

(1) For the model (0, 1) or (1,0), i? comp = 0. 
(2a) For the model (r — 1, r) with r > 2, i? C omp = hr. 
(26) For the model (r, r — 1) with r > 2, i? comp = |r. 

(3) For the model (r, r), i? C om P = r. 

Proof: The items (1) and (3) are straightforward. For the 
(2a) model, we consider two cases: r = 2k and r = 2k + 1. 
Fig. [7] shows an achievable scheme when r = 2k = 2 ■ 2 and 
L = 3. Each transmitter uses odd-numbered levels to send k 
symbols. The special structure of symmetric networks allows 
each receiver to get clean symbols on odd-numbered levels 
while receiving partially-satisfied functions on even-numbered 
levels. For example, receiver 1 gets (01,02) on the first and 
third levels; (61 © Ci, 62 © C2) on the second and fourth levels. 
Note that two resource levels are consumed to compute one 
desired function. Therefore, this gives i? C omp = hr. Obviously 
this can be applied to an arbitrary value of L as well as the 
(26) model. 

Fig. [8] shows an achievable scheme for the case of r = 
2fc+l = 2-2 + 1 and L = 3. If we followed the same 
approach as in the case of r = 2k, each receiver would end 
up with having a resource hole in the last bottom level. In 
this example, receiver 1 would get (ai,6i ffi 01,02,62 © C2) 
on the 1st, 2nd, and 3rd levels, while the last bottom level is 
empty. In order to make an efficient resource utilization, we 
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h r 
'>:; t 


3 C 3 ©d4 
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63 
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3 c 3 ©64 


«i ' 


3C4 
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C3 
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a 3 © b 3 ©d 
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Fig. 8. Achievable scheme for the (r — 1, r) model where r = 2k + 1. 




Infeasible Patterns 



Case (A3) 

Tv, 



Fig. 9. Infeasible patterns of received signals for L = 3. Once perfect 
function alignment is achieved at receiver 3, any vectors cannot be aligned at 
the other receivers. 



again invoke the vector coding idea. At the end of time 1, 
each transmitter sends an additional symbol on the last even- 
numbered level. This transmission causes a conflict at each 
receiver. For example, 03 has a conflict with &2©C2- However, 
this can be resolved by using another time slot. In time 2, 
using the first level, each transmitter re-sends the symbol that 
was sent on the last even-numbered level in time 1 . From the 
second to last levels, we repeat the same procedure as in time 
1 to send fresh k symbols. Note that 0,3 is cleanly received at 
receiver 1 in time 2. This 03 can now be used to decode 62 ©C2, 
which was interfered with by 03 in time 1. Also the &3©C3 that 
was received at receiver 1 in time 1 can be used to decode 04 
which is interfered with by 63 © C3 on the second level in time 
2. Therefore, we can achieve i? C omp = k+ ^ +k = £. The same 



2k + 1) 



strategy can be applied to arbitrary values of (L, r 
as well as the (2b) model. ■ 
Using Theorem H] and Lemma 0] we can now prove the 
achievability. We focus on the case of m < n. The other 
case of m > n similarly follows. For < a < \, (TT~3T > 
gives r = and a = m, thus the decomposition is given by 
(m,n) = (0,l)"- 2m x (l,2) m . Therefore, using Lemma g] 

(n — 2m) + 1 • m = m. Next, 
< a < |. Using ( fl3t . we find that 
r = 1 and a = 2m — n, hence, the decomposition is given by 
(m,n) = (i ; 2) 2 "- 3m x (2,3) 2 ™-™. Using Lemmag] we can 

Finally, 



we can achieve i? CO mp = 
consider the case of \ 



achieve i? com p = 1 
consider the case of a > \. 

_ i„ 



(2n-3m) + |-(2m-n) 



From (fT3T l. we know that r > 2. 



So we get R c 



B. Converse Proof under Linear Coding Strategies 
Straightforward cut-set arguments give R c , 
min{m, n}; hence, it suffices to prove that i?J^ < 



< 



l comp 

max(m,fi) 
^comp — 2 

Consider any vector linear code over N uses of the network 

by Sf 



Denoting the vector of K successive bits of user 
this means that the transmitted signals can be written as 

K 



where v^j are the 
length N ma,x(m, n) to 

1 e F 2 v ' ' is 

K beamforming vectors 
that K computation bits 



"beamforming" vectors of 
be chosen optimally, and 
the matrix composed of the 
of transmitter t. Assuming 
are successfully decoded by 



linear decoding at all receivers, our aim is to prove that 

plin K_ <- max(m,n) 

n comp ■ TV — 2 

Our proof relies on a dimensionality argument, evaluated 
from the receivers' perspective. To formulate our argument, 
we define the following space: 



W it t = span{Tv v 



, TvH^Vfi, Tv £+M , . . . , Tv Ljl }. 



Intuitively, this is the space taken up by the ith computed bit 
at receiver £. First, we observe the following fact: 

Lemma 5: For every receiver I (I = 1, 2, . . . , L), the sub- 
spaces Wi.i, for i = 1, 2, . . . , K, must be linearly independent, 
i.e., for every i,j and £, we have Wi t H Wj : i = 0, where 
is the all-zero vector. 

Proof: This lemma can be proved by contradiction: sup- 
pose there exists j ' ^ i such that Wjj and Wi£ are not linearly 
independent subspaces. Then, it is not possible to guarantee 
that the computed bits i and j can be decoded without error. 

■ 

The following lemma relates the dimensionality of the 
various Wi t £ : 

Lemma 6: For any i, if there exists £ such that dim (Wi t) = 
1, then for all m 7^ £, we must have dim (Wj lTO ) > 2. 

Proof: See Appendix [E] ■ 
This lemma says that for any bit i for which function alignment 
is perfectly achieved at some receiver £ (i.e., dim(W / i i f) = 1), 
then for all other receivers, this same bit i must take up at least 
3 dimensions. For illustration, Fig. [9] shows some examples of 
these infeasible patterns when L = 3. 

We can restate this lemma in the following way. For any 
bit i, one of these two alternatives must apply: 

(B) : There exists £ such that dimfW^) = 1. Then, 
d\m(W l , m ) > 3 for all m ^ £. 

(C) : There does not exist £ such that dirr^W^) = 1. Then, 
dim(Wj )TO ) > 2 for all m. 

For illustration, Fig. [TOl shows examples. Cases (B1)-(B3) are 
the ones where the dimension for any other receiver (except 
the perfect-alignment receiver) is exactly 3. Cases (C1)-(C3) 
are the ones where the dimension is exactly 2 for all of the 
receivers. 

Let us introduce the set J of those indices i for which there 
exists £ such that dirr^Vl 7 ^) = 1. For every i G J, we have 
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Case (Bl) 




Case (B2) 


Case (B3) 
Vl 


Case (CI) 


Case (C2) 


/TV, 




V> v2 




Tvi 









Case (C3) 



Feasible Patterns 



Fig. 10. Feasible patterns of received signals for L = 3. Cases (B1)-(B3) are the ones where the dimension of the linear subspace for any other receiver 
(except for the perfect-alignment receiver) is exactly 3. Cases (C1)-(C3) are the ones where the dimension is exactly 2 for all of the receivers. 



(by case (B) above) 



dim(Wi, m ) > 1 + 3(L- 1) = 3L - 2. (16) 



Let us also denote the complement of the set J (in the set of 
integers between 1 and K) by J c . For every i <E J c , we have 
(by case (C) above) 



J2 dim (Wi, m ) > 2L. 



(17) 



Now, since K = \J\ + \J C \, and by definition, i?j£ mp 
K/N, we can write 



2LNR^ mp = 2L(\J\ + \J c \) 
< \J\(3L-2) + \J c \2L 

(a) L L 

^ EE dim (w*.™) + EE dim (^™) 

i£:Jm= 1 i£j c m-1 



(6) 



m— 1 i— 1 
L 



< JV max(m, n) = LN max(m, n), 



where (a) follows from Lemma [6] (rewritten as in Equations 
(fTSl and (fTTIi). and (&) follows because for each receiver £, the 
subspaces must be linearly independent (Lemma and 
their total dimensionality cannot exceed the total number of 
dimensions available at receiver t over N channel uses, which 
is N max(m, n). Therefore, we get i?comp — max ^ m ' n ^ . This 
concludes the proof. 



C. Proof of Upper Bound 

The following upper bound is a generalized version of the 
2x2 case bound (0. Starting with Fano's inequality, we get 



N((2L - l)f? comp - e N ) 

< E n® s?i yd + (L— 1)/(0 sr, Yl 



(a) 



< Y [H(Y e N ) - H(Y e N \ Sf , [If 
e=i 

+ (L-l)I{®Sl c ;Y 1 N ) 

< x: W) - U(y\ 5f ) + Y n® s ?-> y ^ 



£=1 



£=2 
L 



£ if (1/) - if I Sf ) + £ ii(T«Xf |&) 



£=2 



E^) 



- H(y, [TaX»]Z =2 \ Sf ) + ]T if(T«Xf |5e) 



(e) 



t=2 
L 



< J2 H ( Y i N ) H([T ei X^]f =2 )+J2H(TnXf\S e ) 



i=i 



1=2 



1=1 I £=2 1=2 ) 

L 

< Y H ( Y e N ) < NLmax{m, n) 
i=i 

where (a) follows from the fact that conditioning reduces en- 
tropy; (b) follows from non-negativity of mutual information, 
y := [Yf}^ =1 , and S £ := [Sf]f =1 \ Sf ; (c) follows from the 
fact that Sg's are mutually independent; (d) follows from the 
fact that X( is a function of y (see Claim[T]below); (e) follows 
from the fact that (5^, • ■ • ,Sf) is independent of 4 Sf; 
(/) follows from the fact that S¥'s are mutually independent; 

Claim 1: For m ^ n, Xg is a function of I = 

V" ,L. 
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Proof: Consider the case of to < n. From ( fT3T >. we get 

L L 

Y t = {I © (1 © L)G"" m } Xi 

i=l i=l 
L L £ 

0K, = (l9G"- m )0I,ffi{(L-l)G"-'"}0I,. 



i=2 i=2 

Straightforward computation gives 

L 

X 1 = A~ 1 ®Y i 



i=l 
L 



B 



1- © { (L - l)G"" m } \ A- 1 Yi 



i=2 



where A := {I © (1 © L)G n ~ m } and B := (I © G"~ m ), 
both of which are invertible since to ^ n. Hence, X\ is a 
function of p^j^Li- By symmetry, Xi is a function of [i^]^, 
£ = 2, ■ • • , L. Similarly we can show this for the case of 
to > n. ■ 

VII. Discussion 

A. Multi-hop Networks 

In |[T7ll . ifTTI . Ifl8l . function multicasting has been explored 
in the context of multi-hop networks. While some interesting 
relationship between sum-network and multiple-unicast net- 
works was found in ifTTIl . determining the computing capacity 
in general has been open. For two-source L-destination or 
L-source two-destination networks, the computing capacity 
was established only when the entropy of each source is 
constrained to be 1 |fl8ll . 

While in this work we remove the entropy constraint of 
sources, the network model we consider here is somewhat 
specialized and also restricted to a single-hop network. But 
we expect that our results will shed some lights on arbitrary 
multi-hop networks. One natural next step is exploiting the 
insights developed in this work, to characterize necessary and 
sufficient conditions of two-source two-destination multi-hop 
networks when the entropy of each source is limited by 2. 

B. Role of Feedback for Computation 

The role of feedback for function computation has initially 
been studied in ||27ll where it is shown that feedback can 
increase the computing rate. Interestingly the feedback gain is 
shown to be significant - qualitatively similar to the gain in the 
two-user Gaussian interference channel l28l . which revealed 
an unbounded feedback gain: the gap between nonfeedback 
and feedback capacities can be arbitrarily large as the signal- 
to-noise ratio of each link increases. However, the result 
of ||27ll relies on a separation approach that naturally comes 
in the course of characterizing feedback multicast capacity. 

Our future work is characterizing the feedback comput- 
ing capacity of the networks considered herein to explore 
whether we can do better than the separation approach. This 
will provide a deeper understanding of the feedback gain. 
Moreover it would be more interesting to explore this feed- 
back gain under more realistic scenarios where feedback is 



offered through rate-limited bit-piped links 11291 or through 
the corresponding backward communication network |30l . 
Furthermore, we are interested in extending to more general 
multi-hop networks RD . 

VIII. Conclusion 

We have established the computing capacity of a two- 
transmitter two-receiver ADT symmetric network where each 
receiver wishes to compute a modulo-2-sum function of two 
Bernoulli sources generated at the two transmitters. We also 
characterized the linear computing capacity of an L-transmitter 
i-receiver symmetric network. We developed a new achiev- 
able scheme and derive new upper bounds. Furthermore we 
established a network decomposition theorem that provides 
an alternative but conceptually-simpler achievability proof. 
We expect that the network-decomposition-based framework 
would play a role in extending to arbitrary multi-hop networks. 

Appendix A 
Proof of LemmaQ] 

A. Direct Part — > 

Without loss of generality, assume that nix ~ n i2 — nix — 
n 22 > 0. We can then get: 

Qn 2 i-n 2 2Y 1 © y 2 = (G 9 ~" 11+,l21-rl22 © G q ~ ni2 )Xi. 

For the obvious reason, X\ 6 F 9 , contains nontrivial values 
only on the top n\\ levels: 



Xt 



X t 



On 



e¥ q 



where Xi <E FJ 11 , Using this expression, we can rewrite the 
above as: 



G' 



2 Y 1 ®Y 2 



0„ 



'1— mi 

(G^-^ffiG";;-"-)*! 

where G nil indicates an nn-by-nn shift matrix. Since nn — 
ni2 = n 2 i - n 22 , we have G^~™ 22 © G^~" 12 = 0, and 
therefore any G q ~ nij X\ cannot be reconstructed. Similarly 
one can show that any G q ~ n2j X 2 cannot be reconstructed from 
(Y 1 ,Y 2 ) by considering G ,lll ~ ?ll2 Yi © Y 2 . Hence, a network 
is degenerate. 

B. Converse Part <— 

We will show that if nn — n\ 2 ^ n 2 \ —n 22 , then a network 
is non-degenerate. Consider the following four cases: 

Case I :ni2 < nn, n 21 < n 22 

Case II :n 12 > nn, n 21 > n 22 
Case III :ni2 < nn, ?i 2 i > n 22 
Case IV :ni2 > nn, n 21 < n 22 . 

Note that Case I and Case II are symmetric, so are Case III 
and Case IV. Hence, we focus on Case I and III. 
Case I (ni2 < n\\,n 2 \ < n 22 ): Consider 

Yi © G" 22 -" 21 Y2 = (G^™ 11 © G 9+ " 22 -" 21 -" 12 )Xi 



(Inn 



G™H+"22-n21-«12 S jX 1 
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Since mi - n u ^ n 21 - n 22 , G^+ n22 - n21 -™ 12 ^ I nil and 
therefore I nil © G™^ + " 22- ™ 21 "™ 12 is invertible. This implies 
that X\ is a function of (Yi,^). Hence, a network is non- 
degenerate. 

Case /// (77,12 < ran, n 2 \ > 77,22): First consider the case of 
77,21 — "-22 > "-li — "12- We then get 



QT121— H22J/ Q y 2 _ ^Q9 — ™ll+™21-«22 



lG «-«")Xi 



Since mi-ma ^n 2 _i-n 2 2, G^J"" 22 ^ G^J"" 12 . This im- 
plies that G^i"" 12 Xi is decodable and therefore G 9 ~ ni2 Xi 
is decodable. Hence, the network is non-degenerate. We now 
consider the other case of 77,21 — "-22 < "41 — "-12- We then 
get 

G" u "" 1! ii © Y2 = (G«-" 21+,lll - ni2 ©G«-" 22 )X 2 

(G^-"-ffiG^-" 22 )l 2 

Since mi-ma 7^7721-77,22, { ~™ 12 # ~" 22 . This im- 
plies that GH\- n ^X 2 is decodable and therefore G q ~ ,l22 X 2 
is decodable. Hence, the network is non-degenerate. 

Appendix B 
Proof of Lemma|2] 

Using (0, ( fTOb and ( fTTT i. we compute: 



TV = I 3 © [e 



(«) 

(n — m)+l 



T 2 V = I 3 © 
TPi = e 3 3) 



fe (n; 

^ e 2fn- 



2(n-m)+l 
J 2(n-m) + l 



» 1 
'2(n-ro)J 

e (,l) 1 

3(n— m) J 



^1 3(n— m)+l 



»1 



J 4(n-m) + l 



01} 



o ^ L 3(n— m) + l 
4(n — m) + l 



T^Pi = 

© 

TP 2 - 

ffie 



»1 



(3) - [e&L 



4 3) ®{[. 



2(ra-m) + l 
(«) 



Ml 



>) 

'4(n-m) + l 



01} 



T 2 P 2 = e 



(3) 



>) 

J 3(n-m) + l 



»1 



[e (n) 



4(n— m) + l 



With the Gaussian elimination method, we can show that 

span [Vi, TV 2 , T 2 Vi] 
= span [V,TV,T 2 V,Pi,T 2 Pi,TP 2 ] 
= span [I 3 © I„] . 

Hence, rank [Vi, TV 2 , T 2 Vi] = 3n. Similarly we can 
show that rank [V 2 , TVi, T 2 V 2 ] = 3n. 

Appendix C 
Proof of Theorem|4] 

For Part (1), consider the (km, kn) model. The proof 
uses graph coloring with k colors, identified by integers 
{0,1,..., k — 1}. At transmitter 1, assign to level p (for 



p = 1, 2, . . . , k max(m, 77)) the color (p — 1) mod k. Use 
exactly the same rule to color the vertices of receiver 1 as 
well as the transmitters and receivers of the remaining (L — 1) 
users. It is seen by inspection that each color represents an 
independent graph. Moreover, each color represents precisely 
an (m, 77) model. 

For Part (2), we use graph coloring with 2 colors. At 
all transmitters and receivers, assign one color to the even- 
numbered levels and the other color to the odd-numbered 
levels. By inspection, it can be verified that each color rep- 
resents an independent graph. Moreover, one color represents 
an (777, 77) model and the other represents an (m + 1, n + 1) 
model. 

For Part (3), we use graph coloring with \n — m\ colors, 
identified by integers {0,l,...,|ra — m| — l}.At transmitter 1, 
assign to level p (for p = 1, 2, . . . , max(m, 77)) the color (p — 
1) mod 1 77 — 777, 1 . Use exactly the same rule to color the levels 
of receiver 1 as well as the transmitters and receivers of the 
remaining (L— 1) users. It is seen by inspection that each color 
represents an independent graph. A tedious but straightforward 
calculation shows that of the resulting 77 — 777 1 independent 
graphs, there are a number of models (r + 1, r + 2) and 77 — 
777 — a number of models (r, r + 1), with the claimed values 
for r and a. 

Appendix D 
Proof of Lemma[3] 

We note that Items (1), (2) and (4) are obvious, and Item 
(3b) follows from Item (3a), since without loss of generality, 
for the multicast problem with L = 2 users considered here, 
the case (rra, 77) and the case (77, rra) are mirror images of each 
other in which the roles of transmitters 1 and 2 are swapped. 
We here provide an explicit proof of Item (3a), split into 
three cases. For notation, we will find it convenient to denote 
the vector of (binary) channel inputs used by transmitter 1 
as (Xi t x, X\ t 2i ■ ■ •)* an d me one usec l by transmitter 2 as 
(^2,lj ^2,2) • ■ •) • 

(i) The case r = 3£ ~ 1, for any integer I > 1. For 
I = 1 (hence, r = 2), an explicit code is as follows: 
^1,1 = ai,Xi t 2 = a 2 ,X ly3 = and X 2 ,i = &2,-^2,2 = 
bi,X 2 .3 = 0. It is straightforward to verify that both receivers 
can reconstruct a\ © fei and a 2 © b 2l hence, a computa- 
tion rate of 2 is attained. For the general case, we set 

Xl,3k-2 = 02fe-l)^l,3fc-l = a 2fc,^l,3fc = and X 2 ,3A;-2 = 

fr2fc,^2,3fc-i = b 2k -!,X 2i3k = 0, for k = 1,2, ...,£. Each 
receiver can reconstruct all 21 sums a k © b k and thus, the 
computation rate is 21 = |(r + 1). 

(ii) The case r — 3£, for any integer I > 1, derives easily 
once we have an explicit code for 1=1, i.e., for the (3,4) 
model. For the (3, 4) model, consider coding over 3 channel 
uses, which corresponds (by network decomposition) to the 
(9, 12) model. An explicit code can be found for example 
via the construction (O, ( [Tol l, (ITTb in Section |IV] leading to 8 
computations. For I > 2, we consider a vector linear code over 
3 channel uses, and thus, the (9£, 9£+3) model. For this model, 

we set Xi^k-g = a6fc-5:^l,9fc-7 = a 6fe-3: -X"l,9fc-6 = 
fl6fe-l, -Xl,9fc-5 = fl 6fc-4, -Xl,9fc-4 = &6fc-2j ^l,9fe-6 = 
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a 6fc,^i,9fc-2 = -X"i,9fc-i = ^i,9fc = and X 2 ^k-» = 

fl6fc-4, -^2,9fc-7 = fl6fc-2, ^2.9fc-6 = a6fc,^2,9fc-5 = 
&6fc-5j -^2.9fc-3 = a 6fc-3> -^2.9fc-6 = &6fc-X j -^2.9fc-2 = 

X 2 , 9fe -i = X 2 . 9fc = for fc = 1,2,..., I - 1. It is 
easily verified by inspection that each receiver can recover 
all 6(£ — 1) modulo sums. Additionally, we observe that this 
code does not involve or affect any of the last 12 positions 
at transmitters or receivers. These last 12 positions constitute 
exactly a (9, 12) model, for which we already know that an 
additional 8 computations are achievable. This gives a total of 
6(1 — 1) + 8 = 2(3£ + 1) computations. Thus, per channel use, 
the computation rate is |(3£+ 1) = |(r + 1). 

( in) The case r = 3£ + 1 , for any integer £ > 1 , derives 
easily once we have an explicit code for I = 1, i.e., for the 
(4, 5) model. For the (4, 5) model, consider coding over 3 
channel uses, which corresponds (by network decomposition) 
to the (12, 15) model. An explicit code can be found for 
example via the construction (0, ( tlO) . ( TTTb in Section [IV] 
leading to 10 computations. For I > 2,, we consider a vector 
linear code over 3 channel uses, and thus, the (9£ + 3, 9£ + 6) 
model. For this model, we set Xi^s = ci6k-5, ^i,9fc-7 = 

fl6fc-37 -^l,9fc-6 = &6fc-lj Xi t Qk-5 = <l6fe-4, Xl,9k—4 = 
a6fc-2, -X"l,9fe-6 — a 6fc,^l,9fc-2 = -Xl,9k-1 = -X"l,9fe = 
and X 2 ,9k-8 = fl6fc-4, -^2,9fc-7 = fl6fc-2, -<^2,9fc-6 = 
a6fc,^2,9fc-5 = a 6fc-5: -^2,9fc-3 = a 6k-3, X 2 ,gk-6 = 
a6fe-lj^2,9fc-2 = ^2,9fe-l = -^2.9fc = for k = 1, 2, . . . , I — 

1. It is easily verified by inspection that each receiver can 
recover all 6(^ — 1) modulo sums. Additionally, we observe 
that this code does not involve or affect any of the last 15 
positions at transmitters or receivers. These last 15 positions 
constitute exactly a (12, 15) model, for which we already know 
that an additional 10 computations are achievable. This gives 
a total of Q(£ - 1) + 10 = 2(3£ + 2) computations. Thus, per 
channel use, the computation rate is |(3£ + 2) = |(r + 1). 

Appendix E 
Proof of Lemma[6] 

We focus on the case of m < n. The other case similarly 
follows. By assumption, for the considered i, we have that 
there exists £ such that 

dim (W t j) = 1. 

To simplify notation, and without loss of generality, let us 
suppose that this holds for the last receiver, that is, for I = L. 
But this also trivially implies that 

span(v L ,j) = span(Tv M ) = • • • = span(Tv L _i, 4 ), (18) 

where T := Ijy <E> G n ~ m . Now, consider the subspace 
at receiver 1. We have that 

W iyl = span[v M , Tv 2 ,i, ••• ,Tv L „ M , Tv L;i ] 
= span[v M , Tv M , T 2 v M ] 

where the last equality is due to ( fl~8b . Since we assume that 
Set is decodable at all receivers, any individual symbol 



must appear at all receivers. This implies that 

dim(span(v M )) 
= dim(span(Tvi i i)) 
= dim(span(T 2 vi,j)) 
= 1. 

The key observation here is that for m ^ n, these subspaces 
are linearly independent: 

dim (span[v M , Tv M , T 2 v M ]) =3. 

We can apply the same argument for the other receivers to 
complete the proof. 
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